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Abstract 

We study techniques for identifying highly boosted top jets, where the subsequent top 
decay products are not isolated. For hadronic boosted tops, we consider variables which 
probe the jet substructure in order to reduce the background from QCD jets with large in- 
variant mass. Substructure variables related to two-body kinematics are least sensitive to the 
modeling of parton shower, while those which involve multi-body kinematics may still have 
discrimination power. For leptonic boosted tops, we consider variables which characterize 
the separation between the lepton — although not isolated by conventional criterion — and the 
hadronic activity in the top jet. Such variables are useful in reducing the backgrounds both 
from heavy-flavor jets and from accidental jet-lepton overlap. We give numerical estimates 
of the top identification efficiency versus background rejection rate as a functions of cuts 
on these variables, and find that these variables offer additional useful information above 
invariant mass alone. 
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1 Motivation 

With over 10,000 top events at the Tevatron, CDF and DO have begun to study the detailed 
properties and couplings of the top quark [HE]. At the upcoming Large Hadron Collider (LHC), 
the pair production cross section for top quarks approaches 1 nb [3J, so with an estimated 100 
million top events per year at peak luminosity, ATLAS and CMS will be able to pursue a program 
of precision top physics [H [5] . 

Given the large top Yukawa coupling, it is natural to expect physics beyond the standard 
model (BSM) to have strong couplings to the top sector. Most solutions to the hierarchy problem 
require some kind of top partner 13 El [9] , and if that top partner is kinematically assessable at 
the LHC, then its decay products are expected to contain top quarks [IHl El ttSJ 13] . Similarly, 
if the top quark is a semi-composite state [11], then new resonances in the composite sector are 
likely to have large branching fractions to top quarks [T5j. 

At first glance, the near 100% branching ratio t bW implies that any new physics that 
creates top quarks will be seen in events with multiple heavy-flavor-tagged jets and leptons. 
These exciting event topologies are somewhat challenging to reconstruct unambiguously, but 
in principle one can use standard methods [161 HJ |5] to identify top quarks coming from new 
physics. Indeed, tops produced nearly at rest in the lab frame have three isolated decay products 
which, apart from neutrinos and accidental overlap, can be separately identified. So assuming 
one can isolate a top-rich sample with well-identified decay products, the main challenge would 
be to separate a top-rich BSM signal from the SM top production background. 

On the other hand, it is possible for a top-rich BSM signal to not have isolated decay 
products. If the top is produced with a considerable boost relative to the lab frame, then the 
three decay products from the top will end up in the same part of the detector and possibly be 
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Figure 1: Distribution of the jet invariant mass of the hardest jet in QCD dijet production, 
for pT > 400, 800, 1200, and 1600 GeV. The average jet mass increases with jet pr roughly as 
10% X pt- In this and the remaining figures: the QCD production and showering are simulated 
using Pythia [T7] (in this plot Pythia 8. 108); Fast Jet 2.3.3 [18j is used for jet analysis; jets 
are identified using the anti-fcr algorithm [19] with Rq = 1.0; the distributions are normalized 
to 1 to accentuate shape variations. 

clustered into a single fat jet. This occurs for example with multi-TeV resonances that decay to 
tops [15], i.e. X —f tt, where the boost factor 7t ~ mx/{2mt) can be large. 

One background to these fat jets comes simply from high-p^ top events. Less obvious is 
that high-pT QCD events also can yield fat jets. As shown in Fig. [H the average invariant mass 
of a jet is around 10% (~ ag) of its energyj^ so once QCD jets have ~ 1 TeV, the average 
jet invariant mass approaches the top mass. Said another way, the kinematics of top decays 
and the kinematics of QCD radiation start to look similar when 7^ ~ l/a^. Depending on the 
performance of heavy-flavor tagging at high-py, one also worries about fat flavor-tagged jets, 
which could come from high-p^ bottom or charm production. 

In this paper, we try to increase the purity for boosted top identification by finding variables 
that discriminate between boosted top jets and ordinary QCD fat jets. In Section EJ we consider 
boosted hadronic tops and study the energy asymmetry inside the jet. Because QCD splitting 
functions [20] have soft singularities, it is likely for a jet to have two subclusters with disparate 
energy scales. In contrast, an on-shell top decay yields a W and a b with roughly comparable 
energies. Energy asymmetry turns out to be the unique fat jet observable one can construct 
if one approximates the jet substructure by two-body kinematics, and it offers good separation 
between boosted tops from QCD fat jets. This variable is correlated with the dcut measure 
used in preliminary boosted top studies at ATLAS [2LJ . In Section [3l we examine the prospects 
for identifying top jets through multi-body signatures. While the theoretical understanding of 
multi-body jet substructure is less controlled, we find that by identifying the hadronic W within 
a boosted top, one can obtain complementary information to energy asymmetry. Finally, in 
Section S) we look at boosted leptonic tops and study the kinematics of the muon (or possibly 
electron) coming from the W decay. This muon is not isolated when the the top is boosted, but 
the muon spectrum does discriminate between a boosted leptonic top, a bottom jet with a soft 
muon tag, and accidental jet-lepton overlap. 

^Depending on exactly what one means by "average invariant mass", it may be more correct to say mjot ~ 
\/as log{E/Eo), where Eo is some reference energy. 
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The expected effectiveness of these variables depends cruciaUy on the modehng the high pT 
tail of the QCD background. In the context of a Monte Carlo simulation, to build up 170 GeV 
of radiation in a jet requires multiple QCD emissions. While the parton shower approximation 
is appropriate for describing emissions in the soft-collinear limit, the jet mass and the jet sub- 
structure depends crucially on how exactly the Monte Carlo program treats the kinematics of 
successive emissions. In this study, we focus on variables which are expected to be least sensi- 
tive to the details of the parton showering model, and we study model variance by using several 
showering algorithms. Ultimately, any fat jet variable will have to be tested on data and Monte 
Carlos tuned appropriately. 

Our study focuses on the differences between QCD fat jets and boosted top jets without 
regard to any other features of the event. In most cases, the estimated top tagging efficiencies 
from this paper can be applied directly on top of a standard event selection criteria. However, 
this does mean that we will not discuss the absolute normalization of the background. One reason 
for this choice is that there are theoretical uncertainties in the size of the QCD jet invariant 
mass tail [22], so it is unknown precisely how much of the QCD dijet background enters the top 
mass window. Another reason is that depending on the specific BSM signal one is interested in, 
one would apply additional event selection criteria, and these criteria would dominantly affect 
the normalization of background and only secondarily the spectrum of the hardest fat jet. On 
the other hand, in many interesting new physics processes, boosted tops are pair produced, so 
there could be additional useful variables that correlate the two top quarks. Indeed, the QCD 
background to two fat jets will be different from the background to one fat jet. A full study of 
a boosted tt signal is beyond the scope of this paper and will be left to a future work. 



2 Top Decays vs. QCD Radiation 

2.1 Jet Substructure Strategy 

When a boosted tops decays hadronically, there are no large sources of missing energy, so one 
can identify boosted top jets by finding jets with invariant mass ~ m^. Still, ordinary QCD 
jets can have large invariant mass, so one would like to extract some information about the 
substructure of the jet in order to better distinguish a top decay t bW from a fat jet formed 
from QCD radiation. 

In the parton shower approximation [23], the evolution of a QCD jet is described by a 
sequence of 1 — > 2 splittings, such as the one shown in Fig. [2j After averaging over the azimuthal 
angle, there are only two variables that characterize the splitting M AB. One of them is the 
invariant mass (or virtuality) of the mother particle Qf^. A typical and convenient choice for 
the other variable is the energy sharing 

mm{EA,EB) 

ZM = ^ • (1) 

All other choices for the two splitting variables are equivalent to {Q|^, zm} up to an ambiguity 
as to how to treat the invariant mass of the daughters A and B, which we discuss in more 
detail below. For the remainder of this section, we will only look at variables that can be 
characterized in terms of an M — > AB splitting, leaving a discussion of the theoretical challenges 
and experimental prospects for multi-body signatures to Section [3j 

Intuitively, differences between a QCD jet and a boosted top jet should only be significant 
during the early history of the shower, and the first splitting should be a "soft" splitting in the 
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Figure 2: A schematic diagram of a 1 — > 2 process as one stage of a parton shower history. For our 
study, the mother parton M is assumed to initiate a single jet, and the first sphtting M — > AB 
determines the dominant kinematic features of that jet. After integrating over azimuth, the 
first sphtting it is determined by the jet invariant mass Qm and the energy sharing zm = 
Tmi].{EA, Eb) / Em ■ Because of a theoretical ambiguity as to how to interpret the daughter 
invariant masses Qa and Qb-, there are multiple zm variants, each one giving a potential jet 
substructure observable. 

case of QCD jet and a"hard" splitting in the case of the top jet. In AppendixlAj we review two 
theoretical distributions for zm- The narrow width approximation is a rough proxy for boosted 
tops, particularly for the first two stages of the prompt decay t bW and W — > qq' . The QCD 
parton shower gives a reasonable approximation to QCD fat jets, especially if we only look at 
the first splitting. In terms of {Q\i,zm}, the differential decay distributions for the splitting 
M AB take roughly the form 

dfM^AB ~ (^Oii dzM ^{QIi - m\j), ruM = mt or mw (2) 

dfu^^AB ~ dQli dzM -^P{zm), (3) 

where we have suppressed all non-singular functions. P{zm) is an Alterelli-Parisi splitting 
function [20] that typically scales like 1/zm- 

Comparing df^^^ and d/^^^, the most obvious difference is the spectrum in Qm, and as 
we already said, to distinguish a top jet from a QCD fat jet one first wants isolate a sample of 
jets with invariant mass close to mt- But we see that the spectrum in energy sharing variable 
zm is also different between top jets and QCD fat jets because of the soft singularity of QCD. 
In particular, df^'^^ peaks towards zm ^ in QCD, while df^^^ has its maximum at some 
finite value of zm- If one can find an experimental observable that is correlated with zm, then 
one would have an additional discriminating handle beyond just jet invariant mass. 

In general, it will be difficult to find an experimental observable perfectly correlated with 
a parton shower history. In order to define the energy sharing zm, one has to reconstruct the 
M — > AB splitting, but we of course do not actually observe the showering process. However, 
recursive jet clustering algorithms [231 [251 [261 [271 [281 [19] are designed such that the jet clustering 
procedure approximately reverses the shower history. Given a set of particles that forms a jet 
M, one can apply a clustering algorithm until there are exactly two subclusters A and B^ As 

^It is not necessary to use the same algorithm to define A and B as one used to find A4. In this paper, we will 
in fact use two different procedures, anti-^T for jet finding and kr for subcluster finding. 
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long as the clustering procedure captures the leading singularities of the subclusters A 

and B will be reasonable proxies for the partons that participated in the first QCD splitting, 
and one can define zm accordingly. 

Of course, even with a perfect clustering algorithm, the experimentally identified subclusters 
A and B will not have a one-to-one relation to the theoretical splitting M AB. As an 
extreme example, even if A and B can be separately isolated in the rest frame of M, if the 
boost axis of the system is aligned with the momentum axis of A and B, then A and B will be 
perfectly overlapping and no experimental observable can disentangle them. In addition, finite 
calorimetry means that the QCD soft singularity is in effect regulated, so the experimental zm 
distribution will always vanish as zm — > 0. The hope is that the experimental and theoretical 
definitions of M — > AB will agree some fraction of the time, so any experimental proxy for any 
ZAf-like variable will offer some degree of discrimination power between top jets and QCD fat 
jets. 

Finally, a comment about the criteria we use for identifying jets. In Section 12.31 we will 
define jets using a jet radius R which is larger than the values typically considered by ATLAS 
or CMS, and one might worry that we could obtain more information by using a stricter jet 
selection criteria. For finding boosted tops, though, choosing a smaller value of R only shifts 
the problem from tagging single jets with Q nit to tagging pairs or trios of jets with Q ~ m^. 
That is, even if on a lego plot it is "obvious" that a boosted top is composed of three objects, 
QCD fragmentation also has a finite probability of yielding three objects with Q ^ mt- Whether 
one talks about jet substructure on a fat jet or interjet kinematics among smaller jets, one is 
faced with the same problem of trying to determine the identity of a set of objects with Q ~ m^. 
That said, the value of R can certainly be optimized. 

2.2 Choice of z Variable 

The ideal choice for the z variable is one that is completely uncorrelated with jet invariant 
mass for QCD jets. In that case, one could imagine doing a sideband analysis to determine the 
expected QCD z distribution away from ~ and then extrapolating to the top jet region. 
In Appendix we argue that in the leading logarithmic approximation, the variable that is 
most uncorrelated with invariant mass is the zm variable that appears in the splitting function 
P{zm)- That is, up to subleading corrections, the precise functional form of df^'^^ in Eq. ([26]) 
can be written as the product of a Q|^-dependent piece and a ZM-dependent piece. 

This pseudo-factorization still does not tell us which definition of zm to use. In order for 
the Alterelli-Parisi splitting function P(zm) to make sense, all that is required is that zm 
mm(Eyi, Eb) / Em in the limit that M, A, and B are exactly collinear and all on-shell. Given 
subcluster four vectors pA and p b , there are many functions zm {pa ,Pb) that satisfy this property, 
and they differ only by how the virtualities Q\ and of the daughters are treated. 

In the context of a Monte Carlo simulation, each function zm{pa,Pb) corresponds to different 
assumptions in the parton shower for how to treat successive emissions. This is referred to as 
momentum reshuffling [23] and occurs because the splitting M — > AB with on-shell partons A 
and B has to be modified when A and B themselves split and go off-shell. The ideal choice 
for z is one that is uncorrelated with Qj^j after taking into account the spectra in and 
from subsequent splittings, but unfortunately, the information on the correct interpretation of 

^In order to define zm, it is important that the clustering scheme is "soft-sensitive". For example, the re- 
cently proposed anti-fcr algorithm 19 performs clustering in a way that washes out information about the 
soft-singularity. Similarly, the distance measure used in the Cambridge/ Aachen algorithm 28 also does not 
capture the soft singularity of QCD. 
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Q\ and is not contained in Eq. ([26]) and requires further insight from QCD. We discuss this 
issue more in Appendix El 

Given this ambiguity for how to define z, we present three plausible choices. The simplest 
experimental observable correlated with z is 

_ mm{EA,EB) _ ^ , , 



where Ei are the energies of the calorimeter cells and the subclusters A and B are determined 
using some clustering scheme. This definition of z trivially satisfies the property that it asymp- 
totes to m.m{EA,EB)/EM in the exact collinear limit, though of course in the exact coUinear 
limit, it is impossible to experimentally separate a jet into subclusters A and 

While simple, the above definition of z is by no means unique. A more complicated choice 
for a variable correlated with z is the dcut variable used in the /ct jet clustering algorithm |27j . 
It has been used in the study of highly boosted W and Higgs jets [29] as well as for boosted 
tops [21]. In terms of the splitting M — > AB, dcut is equivalent to 

dcut = mm{p'^A,PTB)^RAB^ ^Rab = (</'A - (pBf + {r]A - rjsf, (5) 

where pTi, <j)i, and r/j, are the transverse momenta, azimuth, and pseudo-rapidity of A and B in 
the lab frame. The combination 

^ ^ dcut _^ inm{EA,EB) 
dcut + Qm + Eb 

in the limit that A and B are collinear and massless, so Zcut is a valid z-like variable. 

One nice theoretical property of using Zcut as a discriminating variable compared to dcut 
directly, is that dcut measures both the soft {z 0) and collinear {Q^ —> 0) singularities of QCD, 
whereas Zcut only captures the soft singularity. Therefore, we expect the Zcut-dependence to 
pseudo-factorize from the Q^-dependence, while dcut is expected to be more correlated with Q^. 
Of course, once one isolates a sample of jets whose invariant mass is near a fixed value mt, Zcut and 
dcut perform nearly identically as discriminating variables for boosted tops. Experimentally, the 
reason to choose Zcut over dcut would be if it is indeed more orthogonal to Q^, both in spectrum 
and in systematics. 

There are Lorentz-invariant z variables such as 

min(pref • PA, Pref ' Pb) 

^lA = 7 \ ^ , (7) 

Pref • [PA+Pb] 

where pref is any reference four vector. It is easy to check that when pA and ps are massless 
and collinear, this yields a valid z-like variable as long as p^ei ■ [pA + Pb) is not exactly zero. 
An obvious choice for pref is Pa + itself, but a downside to that choice is then zli can only 
go to zero when ruA or uib goes to zero, so for typical fat QCD jets with many hadrons, zli 
never sees the QCD soft singularity. In the next section, we will take Pi-ef to be one of the 
proton beams {7 TeV, 0, 0, 7 TeV}|j though this choice is not particularly well-motivated since 



*The tension between a theoretical z that has to asymptote to a given value in the soft-coUinear limit and 
an experimental z that has to know about the A/B distinction means that z cannot be defined using strictly 
coUinear-safe observables. Said another way, to calculate the QCD z spectrum in perturbative theory, one would 
need to consider two partons per fat jet. 

^Of course, the choice of the magnitude of prd does not affect our results. 
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it violates the reflection symmetry of the experiment. Note that ^ceii is equivalent to choosing 
p,ef = {14 TeV, 0,0,0}. 

We emphasize that in the context of Eq. ([3|), ^ceih -^cut, -^lI) and all other z-like variables are 
completely equivalent. Because there is no preferred treatment of the virtualities Q\ and Q\ at 
the leading logarithmic level, we cannot say whether z should be a Lorentz-invariant function, 
or just a function of energy, or just a function of three-vectors. We also emphasize that one 
cannot decide which is the best definition of z from a showering Monte Carlo alone, because one 
would be sensitive to the momentum reshuffling assumptions of the algorithm. 

2.3 Study of z Variable 

We now study the z variables discussed in the previous section. Hard scattering, resonance decay, 
parton showering, and hadronization are performed with various versions of Pythia [iTjlfl We 
have used the Fast Jet 2.3.3 [18] package to cluster and analyze the jets. No smearing of the 
jet or lepton energies have been included in this study. Because we will be using a clustering 
scheme to define observables, we have approximated the effect of detector resolution by including 
finite calorimeter segmentation. Hadrons are binned into (50 x (5// = 0.1 x 0.1 segments, and each 
segment is treated as a massless four- vector whose energy is determined by the hadronic energy 
deposit. The calorimeter extends to \r]\ < 3.0, and no charged track information is used. We 
will focus on the difference in the shapes of various observables, therefore all of the kinematical 
distributions from different processes are normalized to 1. 

In all the analysis, we only use information about the hardest jet, and to find that jet we 
use the anti-/cT algorithm [19], which gives nearly conical jets, with Rq = 1.0. As mentioned 
already, this value of Rq is larger than the typical choices of 0.7 or 0.5, but is justified for the 
purposes of studying fat jets. In a more sophisticated analysis, one would choose different values 
of Rq depending on the expected px spectrum of the boosted top in order to track the degree of 
collimation of the fat jet. In order to define the z variable, we have to identify the subclusters 
A and B. Taking only the calorimeter cells that formed the hardest jet, we recluster with the 
exclusive algorithm [27] to force the formation of exactly 2 jets. This defines the full splitting 
M AB from which we can define any z-like variable. Note that the jet finding algorithm is 
not the same as the jet substructure algorithm. 

The source of highly boosted top jets in our study is a heavy resonance with a open decay 
mode X — > ti. Because we only care about the z variable shape within a single jet, the total 
cross section and angular correlations for X production is irrelevant, though for definiteness we 
use a spin-1 sequential Z' pO], which is a Pythia 8. 108 default. The main effect of this choice 
is to select the PDFs used for production, which affects the rj distribution of the boosted tops. 
We use three benchmark X masses, and corresponding cuts on the hardest jet pT'- 



Resonance Mass 


Pt Cut 


2 TeV 


800 GeV 


3 TeV 


1200 GeV 


4 TeV 


1600 GeV 



(8) 



For the QCD dijet background, we use three different versions of Pythia to estimate typical 

®For simplicity, we have not considered underlying event or pile-up, though these effects are expected to degrade 
the performance of any jet-based measurement. 
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Figure 3: Two dimensional distributions of Zceii (horizontal axis) versus jet mass Qjct (vertical 
axis) for the hardest jet with a 1200 GeV pr cut. Left: boosted top jets from a 3 TeV resonance. 
The top quark resonance structure is visible as a dense horizontal band around Qjet ~ "mt- 
Right: QCD dijet production. The infrared singularity is visible as z — > 0, though this feature is 
somewhat washed out at large Qjet- The intensity of the shading is proportional to the logarithm 
of the occupancy. 



variations among parton shower algorithms: 



Name 


Description 


Pythia 
Pythia 
Pythia 


8 [pt) 
6 {pt) 
6 (Q) 


Pythia 8 
Pythia 6 
Pythia 6 


108 with the default p^-ordered shower 

409 with the "new" pT^-ordered shower (PYEVNW) 

409 with the "old" Q-ordered shower (PYEVNT) 



(9) 



We only consider light quark {uds) and gluon dijet production, though the inclusion of heavy- 
flavor ich) does not dramatically change the distributions. Though SM ti production is an 
important background to X ^ tt, we consider high-p-r tops to be part of the signal for this 
study, and therefore do not include its contribution since the ratio between SM ti and BSM ti 
production is unknown. Because we are looking only at the hardest jet for analysis, the QCD 
background considered here is a reasonable proxy for any background with the jet pT cut from 
Eq. (ED. 

In Fig. [3l we show two-dimension distributions for {Qjeti -Zceii} foi' the 3 TeV top resonance 
and QCD dijets, both with a 1200 GeV px cut on the hardest jet. The top resonance near 170 
GeV is visible in the signal sample, and the collinear and soft singularity near Q^, z ^ is visible 
in the QCD sample. If the clustering procedure were to correctly identify the t bW decay, 
then the expected z distribution from Eq. ([2]) would be nearly flat for the boosted top sample. 
Instead, we see that finite detector size and decay product overlap yields a Zceii distribution 
which falls off approximately linearly towards z^en 0. In the QCD sample, the predicted 1/z 
behavior from Eq. ([3]) folded with the linear ZceW suppression flattens the Zcew distribution near 
Qjet = 170 GeV, those the QCD soft singularity is still observable. 

To isolate a boosted top sample, we first want to impose a jet invariant mass cut around 
top mass window Q^in < Qjet < Q max- Because we are using a largish value of Rq, the top 
decay products are almost always clustered into a single jet, and choosing a fixed lower bound 
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Figure 4: Signal efficiency versus background rejection using a top mass window selection criteria 
160 GeV < Qjet < Qmax with varying Qmax- The background is QCD dijets from Pythia 8 . 108, 
and the signals and jet px cuts are given in Eq. ([5]). In this and the remaining efficiency vs. 
rejection plots, rejection r is defined as the fraction 1/r of the background remaining after the 
cut. The dots indicate the fiducial value Qmax = 200 GeV. 

of Qmin = 160 GeV only eliminates around 20% of the signal. The consequence of using such 
a large jet area is that additional hadronic activity from initial state radiation can enter the 
jet, so the signal has a long tail to larger values of Qjet- In Fig- HI 

we vary the value of (^^max 

and plot the rejection of the QCD background relative to the signal efficiency. For comparison 
with subsequent studies where we include additional variables, we choose a fiducial value of 
Qmax = 200 GeV, labelled by the dots in Fig. [H Amusingly, this value of Qmax gives the 
same 90% background rejection for all three benchmark pj- cuts, which can be understood from 
Fig. [T] because the normalized Qjet distributions intersect near 170 GeV. As the resonance mass 
increases, the signal efficiency decreases, which is expected because contamination from initial 
state radiation is a bigger effect at larger parton collision energy, pushing the invariant mass of 
the top jet out of the top mass window. We expect that in a realistic study, one would optimize 
Qmin and Qmax in conjunction with Rq. 

Having identified the top mass window 160 GeV < Qjet < 200 GeV, we can use the z- 
variables to further purify the sample. We demonstrate typical z variable distributions in Fig. O 
again for the 3 TeV resonance benchmark. The jet invariant mass is shown in the upper left 
panel, and while one might be able to improve signal to background by tightening the top 
window criteria, invariant mass alone probably will not be completely effective in discriminating 
boosted top jets from QCD jets. The z-like variables discussed in the previous section, Zceib -^^cut 
and zli, all show a shape difference between boosted tops and QCD jets, with the QCD soft 
singularity clearly active. In Fig. [6l we assess the additional discrimination power offered by 
the z variable, and for a modest decrease in signal efficiency, one can increase the background 
rejection by a factor of 2 to 5 compared to an invariant mass cut alon. Instead of fixing the 
mass window, it is possible to achieve better results by optimizing (^max 

and ZccU together. For 

example, from the right panel of Fig. [U we see that in the case of 4 TeV resonance, the point 
with fiducial value Qmax = 200 GeV is somewhat below the curve from combined optimization. 
This is fairly common in other cases where we combine several variables. 

The estimate of the QCD background shape depends on the modeling of the parton shower 
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Figure 5: Fat jet spectra of the hardest jet after applying a top mass window 160 GeV < Qjct < 
200 GeV cut, comparing a 3 TeV top resonance with the QCD dijet background, both with a 
1200 GeV pt cut. From left to right, top to bottom: jet mass Qjeti -S-cell) ^Icut) 

and zli- All 

three Pythia versions give similar spectra, validating our assumption that z variables should 
be relatively insensitive to showering assumptions. The dijet background peaks at z = 0, while 
the boosted top signal falls off linearly with z, so a z cut will provide additional discrimination 
power beyond jet mass alone. 
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Figure 6: Signal efficiency versus background rejection using various z variables. Left: efficiency 
vs. rejection relative to the 160 GeV < Qjot < 200 GeV top window cut. The dots indicate the 
fiducial value {Qmax, -Zceii} = {200 GeV, 0.2}. Right: optimizing the Qmax and Zceii cuts. This 
plot is directly comparable to Fig. [H 
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Figure 7: Evaluating the potential for a sideband study of z variables on QCD dijet production. 
While the strength of the soft singularity of QCD changes with different jet invariant mass cuts, 
the variation is modest, suggesting that the z spectrum can be measured at lower Qjet and 
extrapolated to the top mass window. Left to right: Zceii and Zcut- 
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process, so it is encouraging that in Fig. [5l the three different showering algorithms give roughly 
the same z distributions. This suggests that the structure of the soft singularity is relatively 
insensitive to the showering assumptions, though this insensitivity is somewhat artificial since 
all three versions of Pythia use similar momentum reshuffling procedures, and therefore have 
the same preferred notion of z. In Pythia 6 (Q), one can change the interpretation of z in the 
shower using the MSTP(43) option, but we found that these variations were no larger than the 
variations between different Pythia versions. 

A realistic evaluation and optimization of the z-variable should be performed once we can 
study real QCD jets at the LHC. It is important to be able to perform such a study in a "side- 
band" , i.e. in a kinematical regime where the presence of top jets does not distort the infrared 
structure of QCD jets In Fig. m we show the z-variable distribution for QCD jets for different 
jet mass windows. The variation in the ^-distributions are mild across a wide range of jet masses, 
with the biggest difference being the peak shape near 2: — > 0, suggesting that a side-band study 
should be feasible. 

3 Multi-Body Prospects 
3.1 Theoretical Challenges 

In the previous section, we looked at variables that approximated jet fragmentation as a simple 
two-body decay. Because we used a parton shower Monte Carlo to generate events, and because 
the parton shower is defined by a sequence of 1 ^ 2 splittings, we argued that the variables 
most likely to be correctly described by the Monte Carlo are ones that only involve a single 
1 — > 2 split. Given the momentum reshuffling ambiguity as to how to treat successive emissions 
in the parton shower, we wanted our analysis to be as independent of subsequent splittings as 
possible. 

Because an M ^ AB splitting is characterized completely by Q\,i and zm-, many seemingly 
"better" observables are really equivalent to the analysis of Section [2] with a different choice of z 
variable. We already saw that there was a Zcut variable that captured the physics of a dcut rnea- 
surement. Similarly, an energy "dipole" variable is just a different way of phrasing {Q%j,zm}- 
In principle, one could gain leverage by using information about the daughter virtualities, but 
already the different choices for z differ precisely by the treatment of Q\ and Q^, so some of 
the daughter information is already contained in z. The only way to dramatically improve an 
analysis based on M ^ AB kinematics is to find experimental observables that do a better job 
correctly identifying the relevant subclusters A and B. 

Given the limitations of two-body observables, one would like to look at multi-body signa- 
tures. Indeed, the decay of a top quark looks less like a 1 ^ 2 splitting and more like a 1 3 
splitting, so one should be able to use multi-body information to help distinguish boosted tops 
from QCD jets. Such variables are not very well understood in the parton shower approximation, 
but in principle one could extend Eq. ([3]) to correctly handle multiple QCD emissions. Training 
such variables at the LHC with QCD jets in some proper control region will certainly provide 
useful information. With multi-body information about QCD, one can consider more elaborate 
jet substructure observables, and we will look at two specific kinds in this section. 

^Away from Qjct ~ mt, there could be contamination from the signal if the top was far off-shell or if the 
supposed "top jet" only contained part of the top quark decay products. Because z distribution for a boosted top 
will be determined by a hard prompt decay, we expect the signal to be less sensitive to the theoretical uncertainties 
associated with parton shower modeling. Therefore any signal contamination effects could be estimated from 
Monte Carlo. 
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Figure 8: Left: QCD dijet predictions for det S with a px cut of 1200 GeV, showing large 
variations between different shower evolution variables. Right: det S'^ after imposing the top 
window cut 160 GeV < Qjet < 200 GeV, comparing to a 3 TeV top resonance. While det 5-*- 
shows promise in separating boosted tops from QCD fat jets, it is difficult to make a firm 
conclusion given the large theoretical variance. 



3.2 Boost-Invariant Event Shape 

While boosted tops might be described theoretically by an M ^ ABC splitting, one still has 
to find an experimental proxy for the A, B, and C subclusters. Instead of using a clustering 
algorithm, an alternative strategy is to construct an event shape variable that uses all of the 
hadrons in a jet to form an observable that measures the gross energy distribution. 

The goal is to build an event shape that probes the fact the top decay products are widely 
separated in the top rest frame, so one wants a boost-invariant event shape. Ideally, the event 
shape would be invariant under both the boost axis and the boost magnitude. Unfortunately, 
building a meaningful event shape that is invariant under choice of boost axis is difficult, because 
in the M rest frame, the splitting M — > ABC defines a plane. If the boost axis is perpendicular 
to this plane then A, B, and C look well-separated, but if the boost axis is parallel to the plane, 
then A, B, and C overlap. 

We can still form an event shape that is invariant under the boost magnitude, by considering 
a variant to the ordinary sphericity tensor ^l]!! Taking the z-axis to be the boost direction, 
consider a jet with total four vector {E'jgt, 0^,Pj^et} ™d constituents pa = {Ea,Pa,Pa}- The 
(linear) jet transverse sphericity tensor S^^^ is an object that is invariant under boosts along 
the z-axis: 



E 



-Li -^i 
Pa Pa 



S^'^ = "^^^ * . (10) 

oSjet 

There is only one non-trivial eigenvalue of since the two eigenvalues sum to 1, so we will take 
the determinant of to be our boost-invariant event shape. Note that det is identically 



Strictly speaking, even this event shape is not invariant under boosts given finite calorimetry. Even though 
is invariant under boosts, the calorimetry is defined by (f) and r), which is invariant only under boosts along 
the beam axis and not to boosts along the top momentum axis. 
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in the two-body limit, so det S does not have a z-hke interpretation and is a true multi-body 
distribution. It is easy to check that < detS""*" < 0.25. 

One can think of det S-^ as measuring the degree of transverse phase space democracy within 
a jet. Boosted tops are peaked at large values of det S-^ because when the boost axis is perpen- 
dicular to the ABC plane, the three decay products fill the transverse plane. QCD jets peak 
at smaller values of det S'^ because filling the transverse plane would require two large kicks 
relative to the jet axis, but this is suppressed by the splitting functions. 

While det S'^ is a nice event shape in principle, it is not surprising that the theoretical 
distribution is not well understood in the parton shower picture. In the left panel of Fig. [8] we 
show the det distribution for the three different versions of Pythia from Eq. ([9]), and one 
can see that depending on the choice of evolution variable in the shower, the prediction for the 
det 5^ distribution can vary by 30%. In the right panel of Fig. [HI we isolate jets within the 
top mass window 160 GeV < Qjet < 200 GeV and compare to a 3 TeV top resonance as in 
Section [2. 3[ Without information about which prediction is closer to true QCD, it is impossible 
to say how effective det 5"*" would be to separate signal from background. 

3.3 W Reconstruction 

The challenge to using event shapes is that they tend to probe the generic properties of jets and 
therefore require good theoretical control over QCD. A more promising approach is to look at 
observables that are very distinctive for boosted tops, such that one can be confident in their 
effectiveness even without accurate QCD predictions. 

For boosted tops, the obvious distinguishing feature is the presence of an on-shell W inside a 
jet. Using a jet clustering algorithm to identify an M — > ABC splitting, one can check whether 
a pair oi A^ B, and C reconstructs an 80 GeV resonance. In the left panel of Fig. [9l we use 
the same clustering procedure as Section 12.31 to identify the M — > ABC splitting, and plot 
the minimum pair-wise invariant mass Qw between A, B, and C. One can see a clear peak at 
80 GeV in the signal, and there is less variation in the different background estimates, though 
one should regard that observation with caution. 

Once a candidate W is identified, one could try to use the same logic as Eqs. ^ and ([3]) to 
test for the QCD soft singularity with the supposed W jet. In the right panel of Fig. [U we plot 
the zw (defined analogously to -Zceii) distribution for events with 60 GeV < Qw < 100 GeV. 
While there is a slight difference in shape, it is not nearly as dramatic as the shape variations 
seen in Fig. \5\ probably because the overlap between subclusters is larger, blurring the QCD 
soft singularity. 

Therefore, we conclude that the best use of hadronic W reconstruction is through a cut 
on Qw- As seen in Fig. [lOl Qw seems to perform as well if not better than Zceii as a probe 
of boosted tops. Unfortunately, we have checked separately that Qw and ZceW have correlated 
rejection rates, so one does not gain much by combining both cuts, though the systematics 
might be different enough to justify using both. In order to fully assess the effectiveness of 
W reconstruction against background, one should also study the background from boosted 
hadronic Ws |29) with additional radiation, which will pass the Qw cut more readily but fail 
the top window and z^eii criteria more often. 
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Figure 9: Left: applying the W reconstruction technique in the top mass window 160 GeV < 
Qjet < 200 GeV. A clear W resonance in Q]y is seen centered at 80 GeV for the top resonance, 
while QCD dijets are peaked at a lower value. Right: applying an additional W window cut 
60 GeV < Qw < 100 GeV and looking at zw, the analog of Zceii within the candidate W. While 
there is some evidence of the QCD soft singularity as zw —f 0, it is not strong enough to justify 
applying additional cuts on this variable. 




Figure 10: Signal efficiency versus background rejection using W reconstruction. Left: effi- 
ciency vs. rejection relative to the 160 GeV < Qjet < 200 GeV top window cut. The Zceii 
distribution is shown for comparison, and the dots indicate the fiducial value {Qmax,Qw} = 
{200 GeV, 60 GeV}. Right: optimizing the Qmax and Qw cuts. This plot is directly comparable 
to Figs. Hand El 
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4 Leptonic Top Decays 
4.1 Stuck Lepton Strategy 

When a top decays leptonically as t — > biv, the neutrino carries away a substantial fraction of 
the top quark energy. It is straightforward to check that in this decay, the range achievable for 



where we have taken the lepton to be massless for simplicity, and nib should be regarded as 
the energy of the b jet including QCD radiationl^l Because this range extends down to very low 
values of Q^isibie' looking for jets with ~ mf is no longer an efficient strategy for boosted 
top identification. Still, we will use Q^jgibie as one selection criteria for boosted leptonic tops. 
We could in principle reconstruct the neutrino momentum and therefore the full kinematics 
in the semileptonic channel of QCD tt production, particularly near the threshold. However, 
depending on the particular top-rich BSM signal one is looking for, such reconstruction may not 
be feasible as there may be additional sources of missing energy in the event. 

Even if one cannot use the invariant mass of visible objects to completely isolate boosted 
leptonic tops, the lepton from a boosted top is not isolated from the top jet hadronic activity, and 
the presence of a hard lepton "stuck" in a jet is still quite unique. One irreducible background 
comes from high-pr bottom and charm jets as the decay of the resulting heavy flavor meson can 
contain leptons. Another irreducible background is W/Z + jets where the gauge boson decays 
leptonically and the lepton "accidentally" gets caught in a jet. There is also a "background" 
to boosted leptonic tops from boosted hadronic tops with leptonic b decays, though the visible 
invariant mass can help separate these signals. 

Because soft-muon tags are already used for 6-tagging [H [5], we will refer to the stuck 
lepton as a muon, though in principle, stuck electrons might also be observable^ While our 
expectation is that muons should be relatively clean, potential fakes must be taken into account. 
Two commons ways muon fakes can happen is kaon or pion decay in flight or if hadronic 
activity punches-through to the muon chamber [33] . In this work, we will not discuss these 
reducible backgrounds from lepton misidentification, though this might be more dangerous than 
the irreducible backgrounds because of the large rate for QCD jets. On the other hand, if the 
fake muon only carries a small fraction of the total hadronic energy, then the muon spectrum 
will be similar to heavy flavor jets, and we will see that we have good control over the heavy 
flavor background. For the numerical results presented in the rest of this section, we have not 
taken into account realistic lepton efficiencies and fake rates, and leave this to future study. 

Unlike the case of hadronic tops, where there were large theoretical uncertainties in the 
jet substructure observables, we expect the leptonic variables to be well-modeled by (tuned) 
Monte Carlo. For the irreducible backgrounds, the source of the leptons are either W bosons 
or B mesons whose decays are well-understood. This leaves two main sources of theoretical 
uncertainty: the invariant mass spectrum of bottom and charm jets and the spectrum of the 
B and D meson inside the jet. The invariant mass spectrum should be measurable from data, 

^In Eq. 111! we have not explicitly included possible missing energy in semileptonic B decays. This is expected 
to a subleading effect in evaluating Qvisibic- We do, however, take this effect into account in our numerical 
simulation. 

^"it is more challenging to identify an electron within a jet, although its different energy deposit pattern on the 
electromagnetic calorimeter combining with tracking information could be used [32j . 
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using a sample of 6-tagged but soft-muon-vetoed jets. One should also have some understanding 
of the B/D meson spectrum from the displaced vertex mass using the same sample. 



4.2 Lepton Variable Study 

To identify a candidate leptonic boosted top, we identify the muon with the hardest pT in 
an event, and use the anti-fcj- algorithm with Rq = 1.0 to figure out which hadronic activity is 
associated with that muon. We call the muon four- vector and the remaining hadronic activity 
Pb, though we emphasize that we do not use 6-tagging in this study and the hadronic activity 
need not initiated by heavy flavor such as the case of the W/Z + jets background. We define 
three leptonic spectrum variables z^, Aii, and and test their effectiveness for background 
rejection: 

~ - AR = AR{p„p,), x,= (12) 



We will describe the reason for these three variables in more detail below. In order to reject 
truly isolated muons, we impose a cut of < 0.9 on all the samples. As in Section [2T3l we are 
focused on the shapes of the backgrounds, and normalize all distributions to 1. 

For the boosted top signal, we will use the same X resonances as Section [2.31 with looser px 
requirements to account for the missing neutrino: 



Resonance Mass 


Pt Cut 


2 TeV 


600 GeV 


3 TeV 


800 GeV 


4 TeV 


1200 GeV 



(13) 



Using Monte Carlo truth information, we further divide the resonance signal into a leptonic 
sample where the muon comes from W decay and a hadronic sample where the muon comes 
from leptonic b decays. We use two different samples to estimate the irreducible backgrounds: 



Name 



Heavy Flavor 
W + Jets 



Description 



Pythia 8. 108 with bb and cc production (14) 
Pythia 8. 108 with W + j production, forcing W — > fiv'n 



Because we only look at the hadronic activity surrounding the hardest muon, the VF+jets sample 
effectively includes the same physics as Z + jets. Similarly, the heavy flavor sample should give 
a reasonable approximation to any heavy flavor jet with the pT cuts in Eq. (|13|) . 

In Fig. [HI we show two-dimension distributions for {Qvisibie, Xfj_} for the 3 TeV top resonance 
and the two background samples, all with an 800 GeV px (including the px of the muon) cut 
on the muonic jet. We plot only for comparison purposes, and will discuss the spectrum 
in more detail below. In order to isolate the boosted leptonic top sample, we want to impose a 
leptonic top mass window Qmin < Qvisibie < Qmax- Since the heavy flavor dijet background is 
peaked at low values of Qvisibio we choose a reasonably hard cut of Qmin = 100 GeV. The effect 
of changing Qmax on efficiency versus rejection is shown in Fig. 1121 and we take Qmax — 

200 GeV 

as a fiducial value. 

Leptonic spectra for the 3 TeV resonance after imposing the 100 GeV < Qvisibie < 200 GeV 
leptonic top window are shown in Fig. [T3j The simplest variable to understand is z^. A lepton 
from a top decay is prompt and therefore carries a large energy fraction of the top system. A 
lepton from a bottom (charm) jet comes only from the decay of a -B (D) meson, but before the 
meson is formed, a significant fraction of the jet energy is already carried by radiated gluons. 
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Figure 11: Two dimensional distributions of (horizontal axis) versus visible mass Qvisibie 
(vertical axis) of the hardest muonic jet with a 800 GeV pT cut (including pT of muon). Top 
left: leptonic top decays. Because of the large neutrino missing energy, Qvisibie peaks below mt- 
As explained in the text, is peaked near 1 because most of the visible mass comes from the 
leptonic-hadronic angular separation. Top right: hadronic top decays with leptonic b decays. 
Since there is little missing energy in leptonic b decays, there is a peak at Qvisibie ~ iT^t- As 
explained in the text, peaks near because most of the visible mass comes from hadronic 
system alone. Bottom left: W + jets production. Just like the leptonic top decay, is peaked 
near 1. Bottom right: Heavy flavor bb and cc production. Just like the hadronic top decay, x^ is 
peaked near 0. The intensity of the shading is proportional to the logarithm of the occupancy. 
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Figure 12: Signal efficiency versus background rejection using a leptonic top mass window 
selection criteria 100 GeV < Qvisibic < Qmax with a varying Qmax- The backgrounds and signals 
are described in Eqs. (fT3]) and Left: heavy flavor background. Right: M^+jets background. 
The dots indicate the fiducial value Qmax = 200 GeV. Because the leptonic top has a wider 
variation in visible mass than the hadronic case, invariant mass alone is not as efficient at 
rejecting background. 

Therefore, is peaked closer to both for heavy flavor jets and for hadronic tops with leptonic 
b decays. For the W + jets sample, peaks near 1 since most of the muons are isolated, but 
some fraction of the time a jet from initial state radiation can accidentally overlap with the 
muon0 The leptonic boosted tops occupies the middle range of and peaks at .4, consistent 
with the expectation that the muon should carry around 1/3 of the visible energyo 

The differences between the four samples are also seen in the angular separation AR between 
the muon and the hadronic activity. For heavy flavor jets, the muon is aligned almost exactly 
along the jet direction, as expected since the muon tracks the B meson which tracks the b quark. 
For hadronic tops with leptonic b decays, there is an additional deflection in AR from recoil 
against the W boson. The AR offset grows larger for leptonic tops since the bulk of the energy 
follows the hadronic activity. For 11/'+ jets the AR is random, as there is no intrinsic correlation 
between the muon direction and the hadronic activity. 

Finally, a powerful variable for separating leptonic tops from heavy flavor jets is x^, which 
measures how much of the visible invariant mass is carried by the hadronic activity. Taking the 
muon to be massless, we can rewrite X ^ as 

2 

i-^/^ = ^r^- (15) 

^visible 

A boosted top has large pT, but the invariant mass of the resulting b jet is expected to be around 
10% of nit since the available phase space is only a fraction of the top mass after sharing energy 
with the W boson, therefore is peaked towards 1. Similarly, VF+jets peaks at x^ = 1 because 
the muon and the jet are uncorrelated and can be at large separation, meaning that 2p^ ■ pi, 
tends to give the dominant contribution in (p^ + Pb)^ ■ For heavy flavor jets, the lepton carries 

^^There is a small peak near — Q for the W + jets sample, because oi W + c production with leptonic D 
decays. 

^^A better estimate accounting for the W mass is zj^'^^^ = (mj + m^)/(3mt — m%/) ~ 0.44. 
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Figure 13: Muonic jet spectra of the hardest muonic jet after applying a leptonic top mass 
window 100 GeV < Qjet < 200 GeV cut, comparing leptonic tops (blue) with hadronic tops 
with leptonic 6s (green), the heavy flavor background (red), and the W + jets background 
(orange). All samples have a jet pT cut of 800 GeV. Prom left to right, top to bottom: visible 
mass Qvisibio, x^, z^, and AR. Descriptions of the behavior of these variables are given in the 
text. 
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Figure 14: Signal efficiency versus background rejection using the various leptonic variables. 
The top row is the heavy flavor background and the bottom row is the W + jets background. 
The left plots are efficiency vs. rejection relative to the 100 GeV < Qvisibie < 200 GeV leptonic 



top window cut. The dots indicate the fiducial values 



200 GeV, x„ = 0.6, and Zn = 0.7. 



The plots in the right column are directly comparable to Fig. [12] and optimize Qr 
to (for heavy flavor) or (for W + jets). 
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a small fraction of the energy, so in order for a heavy flavor jet to have large invariant mass, it 
must come mostly from the hadronic activity, yielding near zero. Hadronic tops are nearly 
identical to heavy flavor jets with the addition of extra hadronic activity from the W decay, so 
it also peaks at = 0. Of course, to use effectively, one needs good momentum resolution 
on the muon, which becomes increasingly difficult at higher pT- 

In Fig. [T4l we summarize the the boosted leptonic top tagging efficiency versus background 
rejection using the three variables z^, AR, and x^. Without taking into account momentum 
resolution issues, it appears that the x^ variable can be used to completely eliminate the heavy 
flavor background. Therefore, using a large x^ cut along with 6-tagging should give a very clean 
boosted leptonic top signal, because the 6-tagging will help reduce the muon fake background. 
For rejecting the W + jets background, the variable gives a factor of 2 to 4 rejection for only 
a small loss in efficiency. It will be interesting to see how and perform to reduce the muon 
fake background in the absence of 6-tagging. We also remark that the fraction of PF+jets events 
in which the lepton actually overlaps with a jet is small since there is no obvious correlation 
between them. 

5 Conclusions and Outlook 

The top quark holds a unique position in both SM and BSM physics. As a SM resonance, it has 
a spectacular three-body decay with multiple distinguishing features like the 6-tagged jet and 
the on-shell intermediate W boson. As a BSM focal point, it often has large couplings to new 
resonances and might be copiously produced in new physics signals. Depending on the mass 
scales involved, though, top quarks from BSM signals could be produced at large boosts, in 
which case the three-body nature of the top quark decay is obscured. While the large invariant 
mass of the top quark helps separate boosted tops from ordinary jets, fat jets are quite common 
at high pt, therefore it is important to find complementary variables to help increase the purity 
of a boosted top signal. 

For boosted hadronic tops, we found that jet substructure observables can give a factor of 2 
to 5 better background rejection over invariant mass alone with only a moderate loss of signal 
efficiency. We classified jet substructure variables into those directly related to a 1 — > 2 splitting 
process in the parton shower approximation and those involving multiple objects. Two-body 
observables are less sensitive to the parton shower modeling, and the energy sharing variable z 
provides additional information about the identity of a fat jet. There is in principle more infor- 
mation in multi-body signatures, and we found that W reconstruction was a promising strategy 
for boosted top identification, however more theoretical studies are needed to understand multi- 
body jet substructure. 

For boosted leptonic tops, one can look for a hard muon surrounded by hadronic activity, and 
we found that the spectrum of the muon relative to the hadronic activity gave excellent rejection 
against the irreducible backgrounds. In particular, the heavy flavor background could be almost 
completely eliminated through the variable, and the W/Z + jets background could be made 
manageable through the variable. However, further study is needed to understand the issue 
of lepton fakes to test whether the spectrum of muons from inflight decay or punch-through is 
sufficiently different from boosted tops. 

In this study, we suggested variables based only on theoretical considerations, and tested 
their effectiveness with minimal realism on calorimeter segmentation. In order to optimize 
these variables and obtain realistic estimates of tagging efficiencies versus background rejection, 
one needs to include more detector effects, such as tracking information, energy/momentum 
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resolution, and fakes. Full optimization of cuts should come after training these variables on 
real jets at the LHC. One important omission is that we did not make use of 6-tagging, which 
should help improve rejection against QCD fat jets and lepton fakes, though the efficiency of 
6-tagging for boosted tops at high px is expected to decrease [34]. 

The results of this paper apply to the properties of individual fat jets, ignoring any other 
features of the event. In many BSM scenarios, the signal is found not just with a single boosted 
top but in a boosted tt channel. Therefore, in addition to the single top tagging observables 
we have studied in this paper, variables which correlate the two top jets should be explored to 
enhance the signal reach. 
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Shortly after this paper was posted, Ref. [36j appeared which presented a complementary 

boosted top strategy. 



In order to understand the QCD soft singularity, we have to understand the expected z distribu- 
tion. For illustrative purposes, we will compare the z distribution between a narrow resonance 
decay and QCD radiation. Because the decay products of a boosted top are generically over- 
lapping, we do not expect a boosted top to look exactly like the narrow width expectation, but 
such a comparison is still instructive. To begin, we will describe two-body phase space variables 
and then compare the differential distributions between the two cases. 

Consider the process pp XAB, where A and B are sub clusters of interest, and X is every 
other final state in the event. We want to approximate the cross section for this process in a 
factorized form as 



where the "mother" parton M is a proxy for the fat jet. If M is a resonance, then such a 
factorized form can be justified in the narrow width approximation, with dfM^AB being a 
differential branching ratio. This factorized form also makes sense in the soft-collinear limit, 
where dfM^AB is the product of a differential splitting function [20] and a Sudakov factor [35] . 

The phase space variables for the "decay" M AB (with masses Qm, Qa, and Qb, respec- 
tively) are 




A The QCD Soft Singularity 



da. 



'pp^XAB = dapp-^xudfu^AB 



(16) 




(17) 



where the on-shell two body phase space is given by 




dVL2 = dcos 9 dcj). 



(18) 



with the usual phase space factor 



KQm;Qa,Qb) = ^(i-qT-^-qT-J 



(19) 
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For unpolarized measurements, the i;^-dependence is uniform, so the phase space of Af — > AB 
is characterized by two independent quantities Qm and cos^. In order to study the QCD soft 
singularity, the natural variables for fat jets are the invariant mass Qm and some energy sharing 
variable z = Ea /Em\^ and in general there will be a Jacobian d cos 6 /dz in the transformation 
from cos 9 to z. 

If z is interpreted strictly as Ea/Em, then two-body kinematics restricts the range for z to 

be 

1 Qi Ql_ ^ _ 



1 + 



-KQmMa^Qb)^ Pm = - Qm/eIj, 

'M 

where Pm is the boost magnitude from the M center-of-mass frame to the lab frame 



(20) 



Because 

these limits depend on and Qb, in a parton shower with multiple emissions, the correct z 
limits on M ^ AB can only be determined after one knows how A and B will split which sets 
the values of Qa and Qb- In particular, a value of z that satisfies Eq. (f20l) for Qa,b = might 
be invalid for Qa,b > 0. There are various ways to deal with this ambiguity [23], and most 
parton showers employ some kind of momentum reshuffling procedure, but it means that the 
interpretation of z in dfM^AB can depend on Qa and in a non-trivial and algorithm-specific 
way. 

We can now compare the differential distributions dfM^AB between the narrow width ap- 
proximation and QCD radiation. In the narrow width approximation, the mother M is exactly 
on-shell: 



NWA 
M^AB 



dQ 



M- 
2 



^AB 



-Br(M ^ AB)6{Q 



M 



(21) 



2tt V2 

where V2 = j d^2 is the volume of two-body Lorentz invariant phase space, which depends on 
the masses of A and B. Note that df^^^ is uniform in cos 9. 

In the soft-collinear QCD case, one can use a parton shower language [23] where the nat- 
ural variables are the evolution variable /i and the energy sharing variable z, both of which 
are functions of {Q\f,cos9}. Unlike the narrow width approximation, the parton M is never 
on-shell, and its off-shellness is determined by the evolution variable fi{Q\.j , cos 9) . Using unpo- 
larized splitting functions defined in terms of the energy sharing variable z{Q\j, cos 9), the QCD 
splitting is described by 



dfM^^AB = d log n'^ ^ dz 



PM^AB{z)A{flsts.rt,fJ-), 



27r 2tt 

where PM^ABiz) are the usual Altarelli-Parisi splitting functions 



f22) 



Pr 



9-^99 



(z) = Cf 
(z) = Ca 



1 + z^ 
1-z 
1 - z 



+ 



Pg-.,g{z) =TR[z^+il 



z 



1 - z 

zf]. 



+ zil-z) 



and A(^start5Ai) is a Sudakov factor [35] 



(23) 



A(;Ustart, Ai) = exp 



AB 



/^start 



d log n' 



27r 



dz 



27r 



P 



M^AB 



(z) 



(24) 



^''in the main body of the text, we define z as min(_EA, Eb)/Em since A and B are indistinguishable. Here, A 
and B have meaningful quantum numbers, so it makes sense to talk about z — Ea/ Em. 
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Note the soft singularities at z — > or z — > 1 which are equivalent to singularities at | cos 9\ ^ 1. 

We want to describe both d/^^"^ and df^^^ in terms of the natural boosted top variables 
Q\j and z. We have not yet specified the functional form of fi and z, and in general, there will be 
a non-trivial Jacobian in the transformation between {fJ-,z} and {Q^,cos6}. Typical evolution 
variables are proportional to invariant mass as iJ? = Q\,jf{z), in which case dlogfi^ = dlogQ'j^. 
Integrating over (p for simplicity 

= ^dz^Br{M AB)6{Ql, - ml), (25) 

df^^Zn = (26) 

where /i is still a function of and z. Assuming that d cos 0/dz does not contain any sin- 
gularities, this function has the form advertised in Eqs. ^ and If Qa = Qb = 0, then 
dcos6/dz = 2/Pm- 

We see that df^'^^ does not factorize into a Ql^-dependent and a z-dependent piece, because 
the as factor and Sudakov factor are written in terms of the evolution variable jj, that mixes 
and z. On the other hand, at the double logarithmic level the value as is fixed, and because the 
definition of /igtart is ambiguous in the Sudakov factor, the value of /igtart can be chosen remove the 
z-dependence up to subleading corrections. In other words, the non-factorizing pieces occur only 
at the single logarithmic level, so d/^*-'^ does factorize in the leading logarithmic approximation. 

To design the perfect z-variable for boosted tops, one would want to figure out which z-like 
variable is maximally orthogonal to for real QCD in the neighborhood of ~ m|. This 
requires understanding the spectrum of Qa and Qb from subsequent splittings and how they 
influence the interpretation of z. Whether this is in principle possible depends on whether the 
leading corrections to df^^^ still respect the factorized form of Eq. (|16p . If higher order QCD 
corrections simply modify the functional form of d/^^^, then there is always some variable z 
that is orthogonal to Q^, though in practice it may be difficult to find an experimental observable 
correlated with z. But if higher order QCD corrections imply that the ideal choice of z variable 
strongly depends on the spectators X in the process pp XAB, then there is no way to improve 
the definition of z. A plausible intermediate possibility is that df^'~'^ depends on X only in some 
limited way such as in the definition of /^start- If ^-like variables are found to be useful in an 
experimental context, then a detailed study of df^''^^ is warranted. 
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